Document Image Registration for Imposed Layer Extraction
نویسندگان
چکیده
Extraction of filled-in information from document images in the presence of template poses challenges due to geometrical distortion. Filled-in document image consists of null background, general information foreground and vital information imposed layer. Template document image consists of null background and general information foreground layer. In this paper a novel document image registration technique has been proposed to extract imposed layer from input document image. A convex polygon is constructed around the content of the input and the template image using convex hull. The vertices of the convex polygons of input and template are paired based on minimum Euclidean distance. Each vertex of the input convex polygon is subjected to transformation for the permutable combinations of rotation and scaling. Translation is handled by tight crop. For every transformation of the input vertices, Minimum Hausdorff distance (MHD) is computed. Minimum Hausdorff distance identifies the rotation and scaling values by which the input image should be transformed to align it to the template. Since transformation is an estimation process, the components in the input image do not overlay exactly on the components in the template, therefore connected component technique is applied to extract contour boxes at word level to identify partially overlapping components. Geometrical features such as density, area and degree of overlapping are extracted and compared between partially overlapping components to identify and eliminate components common to input image and template image. The residue constitutes imposed layer. Experimental results indicate the efficacy of the proposed model with computational complexity. Experiment has been conducted on variety of filled-in forms, applications and bank cheques. Data sets have been generated as test sets for comparative analysis.
منابع مشابه
Contourlet-Based Edge Extraction for Image Registration
Image registration is a crucial step in most image processing tasks for which the final result is achieved from a combination of various resources. In general, the majority of registration methods consist of the following four steps: feature extraction, feature matching, transform modeling, and finally image resampling. As the accuracy of a registration process is highly dependent to the fe...
متن کاملDocument Analysis And Classification Based On Passing Window
In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...
متن کاملNovel Adaptive Filtering for Salt-and-Pepper Noise Removal from Binary Document Images
3D Meshes Registration: Application to Statistical Skull Model p. 100 Detection of Rib Borders on X-ray Chest Radiographs p. 108 Isosurface-Based Level Set Framework for MRA Segmentation p. 116 Segmentation of the Comet Assay Images p. 124 Automatic Extraction of the Retina AV Index p. 132 Image Registration in Electron Microscopy. A Stochastic Optimization Approach p. 141 Evolutionary Active C...
متن کاملReconnaissance et extraction de documents. Une application industrielle à la détection de documents semi-structurés
This article deals with the problem of recognition of semi-structured documents image. The aim is to detect a document and to extract the region of interest containing it. Initially, an exemple of document is given by the user and a set of interest points are extracted from this query image. In a second step, a set of interest points is extracted from each image to analyse and is matched with t...
متن کاملStrategies for promoting the Supervisory board Subject of Article 6 of the Registration Law Emphasizing the Transformation Document of the Judiciary
Abstract The Supervisory Board (Article 6 of the Law on the Registration of Deeds and Property) is the authority to deal with disputes and errors regarding the registration of documents and property. This reference lacks a procedure. The current method of handling this reference is incomplete and contrary to the policy of reducing the work of the court. If we want to make minor reforms in the ...
متن کامل